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We study a robust utility maximization problem with non entropic penalty term. 
We consider two types of Penalties. The first one is the f-divergence penalty studied 
in the general framework of a continuous filtration. The second called consistent time 
Oh penalty studied in the context of a Brownian filtration. These Penalties generalizes 

the relative entropy. We prove in the two cases that there exists a unique optimal 
. probability measure solution of the robust problem, which is equivalent to the historical 

probability. In the case of a consistent time penalty, we characterize the dynamic value 
process of our stochastic control problem as the unique solution of a quadratic backward 
stochastic differential equation. 
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H 

In the literature connected with the utility maximization problem, the optimality criterion 
is based on a classical expected utility functional of Von Neumann- Morgenstern form, which 
requires the choice of a single probabilistic model P. In reality, the choice of P is subject 
to model uncertainty. Schmeidler [16] and Gilboa and Schmeidler [6] proposed the use of 
robust utility functionals of the form 

X^ME Q [U(X)\ (1.1) 

where Q is a set of prior probability measures. Later, Maccheroni, Marinacci and Rustichini 
[13] suggested to model investor preferences by robust utility functionals of the form 

X^mfE Q [U(X) +1 (Q)}- (1-2) 
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The most popular choice for the penalty function is the entropic penalty function j(Q) = 
f3H(Q\P) for a constant f3 > and a reference probability measure P; see, e.g., Hansen 
and Sargent [9]. A general class of the penalty function 7 was developed by Schied by using 
duality methods. In this article, we are interested in a control problem of type (jl.2p with 
the general penalty term by using stochastic control approach as in Bordigoni, Matoussi 
and Schweizer [8]. Specifically we are trying to solve the following problem 

mf Eq[U ,t + PK ,t(Q)] (1.3) 

where 

U & tT S s s U s ds + aS T U T 

with : 

• a, a. are two parameters in [0, +00) and (3 G (0, +00). 

• (Ut)o<t<T a progressively measurable process. 

• Ut a random variable J^-measurable. 

• S s is the discounting process defined by: 

Sf ■= exp(- I 5 s ds); < t < T 
Jo 

where (5t)o<t<T is a progressively measurable process. 

• HtT{Q) is a penalty term which is written as a sum of a penalty rate and a final penalty. 
Note that the cost functional: 

c(w,Q) :=U^ T + (3TZ^ T (Q) (1.4) 

consists of two terms. The first is a Q-expected discounted utility with discount rate 8, 
utility rate U s at time s and terminal utility Ut at time T. Usually, U s comes from con- 
sumption and U T is related to the terminal wealth. The second term, which depends only on 
Q, is a penalty term which can be interpreted as being a kind of "distance" between Q and 
the historical probability P. In the presence of uncertainty or model ambiguity, sometimes 
also called the Knight uncertainty, the economic agent does not know the probability law 
governing the markets. The economic agent has a certain conjecture about the position of 
the true probability distribution Q,but not with total confidence. So he regards it as being 
much plausible than other probabilities. This plausibility can be measured using 7^o,t(<3)- 
The agent penalizes each sight probabilistic possible Q in terms of penalty 7£o,t(Q) and 
adopts an approach of the worst case by evaluating the profit of a given financial position. 
The role of proportionality parameter f3 is to measure the degree of confidence of the de- 
cision maker in the reference probability P, or, in other words, the concern for the model 
erroneous specification. Higher value of (3 correspond to more confidence. 
In this paper we studied two classes of penalties. The first class is the /-divergence penalty 
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introduced by Cizar [2] given in our framework by: 




(1.5) 



where / is a convex function. In this case, the set Q consists of all models Q abso- 
lutely continuous with respect to P whose density process (with respect to P) satisfies: 



The second called consistent time penalty studied in the context of a Brownian filtration 
generated by a Brownian motion iWt)o<t<T- 



where h is a convex function and the density process of Q v with respect to P can be written: 



In this case, the set Q is formed by all models Q v absolutely continuous with respect to 
P such that Eqj?[ / h(rj s )ds] < +oo. Using HJB equation methods, Schied [TS] studied 



the same problem when 5 is constant and the process rj takes values in a compact convex 
set in M. 2 . The paper is organized as follows. In the second section we study the robust 
utility problem where the penalty is modeled by the /-divergence. We prove the existence 
of a unique probability measure Q* equivalent to P. The third section is devoted to the 
class of consistent time penalty. We also prove the existence of unique probability measure 
equivalent to P. We characterize the value process as the unique solution of a quadratic 
BSDE. 

2 Class of f-divergence penalty 
2.1 The setting 

This section gives a precise formulation of our optimization problem and introduces a num- 
ber of notations for later use. We start with a filtered probability space (Q, J-, F, P) over a 
finite time horizon T € (0, +oo). 

The filtration F = (J-t)o<t<T satisfies the usual conditions of right-continuity and P- 
completeness. 

For any probability measure Q<Pon Tt, the density process of Q with respect to P is 
the RCLL P-martingale Z Q = (zf) < t < T with 



E P [f(Z^)} < +oo. 




(1.6) 
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Since is closed on the right by zf = ^ \jr t , we can identify Z® with Q. 
For all Q <C P on Tt, we define the penalty term by 



TZ S 0T := [ 5 s S 5 J-^^ds + Si^^-; for all < t < T 
Jo Z? Z® ~ ~ 

where / : [0, +oo) h->- R is continuous, strictly convex and satisfies the following assumptions: 
(H.l) /(l) = 0. 

(H.2) There is a constant k G M+ such that f(x) > —k, for all x G (0, +oo). 
(H.3) lim ^M = +oo. 

ih>+oo x 

The basic goal is to 

minimize the functional Q i-s- T(Q) := Eq[c(., Q)] (2.1) 
over a suitable class of probability measures Q <C P on Tt ■ 

Definition 2.1 For a convex function if we define the following functional spaces: 
is the space of all Tt measurable random variables X with 

E P [<p{i\X\)\ < oo for all 7 > 0, 

Dq is the space of all progressively measurable processes X = (X t ) Q<t<T with 

Ep [ip (7 ess sup 0<t<T |Xi|)] < 00 for all 7 > 0, 

Df is the space of all progressively measurable processes X = (X t ) 0<t<T such that 

<p(-yj \X s \ds) 



Ep 



< 00 for all 7 > 0. 



Definition 2.2 For any probability measures Q on (Q,T), we define the f- divergence of Q 
with respect to P by: 



d(Q\P) 



E H/(§Lft)] ifQ<.PonT T 
+00 otherwise 



If f{x) = xmx, then d{Q\P) is called relative entropy and is denoted by H(Q\P). 

We denote by Qf the space of all probability measures Q on (Q,T) with Q <^ P on Tt, 

Q = P on Tq and d(Q\P) < +00. The set Q e j is defined as follows 

Q} ■= {Q G Qf\Q~P on T T }. 
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We define the conjugate function of / on M. + by: 

f* (x) = sup (xy-f(y)). (2.2) 

f* is a convex function, non decreasing, non negative and satisfies: 

xy < f*(x) + f(y), for all x£K + and y > (2.3) 

and also 

xy < ~[f*hx) + f(y)], for all x G M+, 7 > and y > 0. (2.4) 

7 

For a precise formulation of (|2.1|) . we now assume: 
(Al) 5 is positive and bounded by . 

(A2) The process U belongs to d( and the random variable Ut is in IT 
Remark 2.1 1- the assumption f(x) > —k implies that : 

I fix) |< f(x) + 2k Vx > 0. (2.5) 

Indeed we have 

I /O) I - I I 

<ll /(x) I - 1 -« II 

<| /(X) + K |= /(x) + K. 

/n i/ie case of entropic penalty, we have f(x) = xln(x) and then f*(x) = exp(x — 1). 
As in Bordigoni, Matoussi and Schweizer f^/, the integrability conditions are formu- 
lated as 



E P 



exp(A I \U(s)\ds) < +00 and Ep [exp(A|C/T|)] < +00 for all A > 0. 
^0 J 



2.2 Existence of optimal probability measure 

The main result of this section is to prove that the problem ()2. 1 j) has a unique solution 
Q* € Qf. Under some additional assumptions, we prove that Q* is equivalent to P. This 
is proved for a general filtration F. We begin this section by establishing some estimates 
for later use. 

Proposition 2.1 Under (Al)-(A2),for all Q € Q; we have 

1. c(.,Q)GL 1 (Q) 

2. T(Q) < C(l + d(Q\P)) for some a constant C G (0, +00) which depends only on 
q, a, /3, S, T, U, U. 
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In particular T(Q) is well-defined and finite for every Q G Qf 
Proof. 

1. To show that c(.,Q) G L^iQ) is equivalent to prove that Z®c(.,Q) G L l (P). We set 
R := a J T \U{s)\ds + a|f7|. We have 



\Z?c(.,Q)\ <Z®R+ WSW^ Z% [ 

Jo 



f 




/o 


z? 



ds + 



f(Z, 



r ■ 



By the inequality $n$, we have Z$R < f(Z®) + f*(R). Under Assumption (A2), 
the variable random f*(R) is in L 1 (P). 

f(Z$) < f(Z?) + 2k. Since E P [f(z9)] < +oc, then 



From Remark 12.11 we have 



Fubini Theorem's, we have 



f 


f{Z?) 


lo 


z? 



ds G L l (P). By Tonelli- 



E P \Z\ 



f(Z c s 



ds] = / E P [Z^ 
Jo 



f(Z?) 



ds 



Jensen's inequality provides that 
f{Zf) = f (e p 



Zrp \ J~ 5 



< Ei 



f(Z?) \K 



f(z?) 



\ds= f E P [\f{Zf)\]ds. 
Jo 



By taking the expectation under P, we obtain 



Ep (/(£?)) < Ep 



f Z 



Consequently, 



E P [\f(zQ)\] <Ep \f(z 



+ 2k, 



f 


f{Z?) 


lo 


z? 



(2.6) 

(2.7) 
ds G L X (P). 



2. From the definition of T, we have 



r(Q) < Ep 



Ji -ft 



/3Ei 



l*llc 



By the inequality (|2.7p . we have 



E, 



|/(2?)|<fe + |/(2#) 



< 



'0 

r^Ep 



E P [|/(Z«)|] & + Ep [|/(z£ 



/ [Z 



+ 2k +Ep 



/ [Z 



+ 2k, 



and consequently, 

r(Q) < Ep [f*(R)} + 2« i S(||*|| 00 T + 1) + (1 + T + /3)d(Q|P). 
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We define the constant C by: 

C := max (E P [f*(R)} + 2k/3(\\5\\ 00 T + 1), (1 + (3 M^T + /3)) 
From assumptions (A1)-(A2), C is finite, positive and answers the question. 

□ 

We will need more precise estimation of T(Q). 

Proposition 2.2 There exists a positive constant K which depends only on a, a, f3, S, T, U, U 
such that 

d(Q\P) < K(1 + T(Q)). 

In particular inf T(Q) > —oo. 

Q£Q f 



Proof. From the Bayes formula, we have: 

Eg[ f 5 s S s J-^ds\T T ] = -^E P [ f 5 s S 5 s f{Zf)ds\F T ] > -^Tk \\ 5 
Jo Z7 Zt Jo Z? 

In the same way, by using exp(— T \\ 5 ||oo) < Sf. < 1, we get: 



E Q [S S T ^-\T T ] = ^E P [S 5 T f{Z$)\F T ] 

= ^E P [SUf(Z?)-K + n]\T T ] 
Z T 

> -^(- K + e- T ^°°( K + E P [f(Z®)\T T }) 
Z T 

> J_(_ K + e -T||5||oc Ep[/(z Q ) |^ T]) 
Z? 

We set R T := a \U s \ds + q;|C7t| and R = Rq. By using < S s < 1, and Bayes formula, 
we have 

E Q [Ul T \F T ) > -E Q [R\T T ] 

= — \~e p [z3r\t t ]. 
z? 

By using (|2.4p and since /* is non decreasing, we obtain: 

E P [Z$R\F T ] < ±E P \f(Z$) + r{iR)\T T ] 

< -Ep[f(Z®)\T T ] + -M P [r( 7 R)\T T }. 

7 7 

Thus, we have: 



E Q [c(.,Q)\F T ] > -^TK\\6\\^+^(-K + e- T ^Ep[f(Z^)\T T ]) 



1 



-Ep[f(Z^)\T T } + -E P [f*( 1 a ( T \U s \ds + 1 a\U T \)\T T 
7 7 Jo 



(2.8) 
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By choosing r = and taking the expectation under Q, we obtain: 
r(Q) > - /3T«||*||oo + /3[-k + e- T ^"°Ep[f(Z®)}}- 



(-E P [/(Z?)] + -E P [/*0 / |*7 s |ds + 7 «|C/t|)]) 
7 7 Jo 

= -^(TII^IU + l) + d(Q|P)[/3e 



7 



1 ' T 



E p [/*( 7 q / \U a \ds + >ya\U T \)]). 
7 Jo 

By choosing 7 large enough, there exists ij > such that /Je -7 "" 15 " 00 — ^ > rj. We set 

ff:= imax(l, ^(T^U + l) + -E P [f*(ja [ \U s \ds + ja\U T \)]). 
V 7 Jo 

Under the assumptions (A1)-(A2), K is finite and so the proof of the proposition is 
achieved. □ 
The following lemma is useful to show the existence of Q* which realizes the infimum of 

Q ^ r(Q) 

Lemma 2.1 For all 7 > and all A G Tt we have : 

Eq[\U$ t \1 a ] < -(d(Q\P)+K) + -E P [f* ( 7 q / \U s \ds + 7 a\U T \)lA}- (2.10) 
7 7 Jo 

Proof. From the definition of Uq t and using inequality (j2.3|) , we have 

Z$\l4 T \l A <Z$(a [ \U s \ds + ja\U T \)lA 
Jo 

< ~[f(Z?) + /*(7« / I + ia\U T \)]l A . 
7 Jo 

Using Assumption (H2), we obtain 

rp 

^|Z^ T |U<-[/(^) + « + r(7« / |^Ms + 7a|t7 T |)]l^ 
7 Jo 

<i[/(Z«) + K] + i[/*(7a / |C/ s |d S + 7 a|C7 T |)]l A . 
7 7 Jo 



The result follows by taking the expectation under P. □ 
The following theorem shows the existence of unique probability measure solution of opti- 
mization problem (|2,ip 



Theorem 2.1 Under (A1)-(A2), there exists a unique Q* G Qf which minimizes Q h4 
T(Q) over all Q e 2/ . 

Proof. 
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1. Qh> r(Q) is strictly convex; hence Q* must be unique if it exists. 

2. Let (Q n ) n eN be a minimizing sequence in Qf i.e. 

\ lim T(Q n ) = inf r(Q) > -oo, 

n->+oo Q6Q/ 

and we denote by Z n = Z®" the corresponding density processes. 
Since each Zj, > 0, it follows from Komlds' theorem that there exists a sequence 
(ZJf) n( zji with Z% € conv(Z^, Z^, + , ...) for each n € N and such that (^) converges 
P-a.s. to some random variable Z^P which is nonnegative but may take the value 
+oo. Because Q/ is convex, each ZJj^ is again associated to some Q n 6 Q/. We claim 
that this also holds for , i.e., that := Z^dP defines a probability measure 

Q°° € Q/. To see this, note first that we have 

T(Q n ) < sup T(Q m ) = T(Q n ) < r(Q 1 ), (2.11) 

m>n 

because Q !->■ r(Q) is convex and n i-> r(Q ra ) is decreasing. Hence Proposition 12.21 
yields 

supE P [/(Z n )] = supd(Q n \P) < K(l + supT(Q n )) 

neN neN neN ^ yft 

< K(l + supT(Q n )) < Kil + TiQ 1 )) < +oo. 

neN 

From Assumption (H3) and using de la Vallee-Poussin's criterion, we obtain the P- 
uniformly integrability of (Z^) n eN and therefore {Z T ) n ^ converges in L 1 (P). This 
implies that ~Ep[ZS?] = lim E P [Zj] = 1 and so Q°° is a probability measure and 

n— ¥+oo 

qco ^ p on -p T _ Because / is bounded from below by k, Fatou's lemma and 
inequality ()2. 12|) yield 

d(Q°°\P) = Ep[f(Z2?)] < liminf E P [f(ZZ)] < +oo. (2.13) 

n— >+oo 

Finally, we also have Q°° = P on J-q. In fact, [Zj) converge to Z%? strongly in L l (P), 
hence also weakly in L X (P) and so we have for every A E Tq : 

Q°°[A] = E P [Zfl A ] = lim E P [Z T 1 A ] = lim Q n [A] = P[A}. 

n— >+oo 71— M-oo 

The last equality holds since Q n (^4) = P(A) for all n G N and A £ J-q. This shows 
that Q°° G Q/. 

3. We now want to show that Q* := Q°° attains the infimum of Q i— >■ T(Q) on Qj. 

Let Z°° be the density process of Q°° with respect to P. Because we know that {ZTf) 
converges to Z°° in L X (P), the Doob's maximal inequality 

P[ sup | Z t °° - |> e] < -E P [\ Zf - Zip |] 

0<t<T e 
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implies that ( sup | Z t °° — Z t n |) n gN converges to in P-probability. 

0<t<T 

By passing to a subsequence that we still denote by (Z n ) ne pj, we may thus assume 
that (Z n ) converges to Z°° uniformly in t with P-probability 1. This implies that 
ZZc(., Q n ) — > ZS?cL, Q°°)P-a.s. and in more detail with 

n— >+oo 

Y? := Z%4 tT , Y? := f3( f 5 s S & J{Z™)ds + S 5 T f(Z%)) = /3< T (Q n ) 

Jo 

for n G N U {+00} that 

lim K n = Y°°P - a.s. for i = 1, 2. 

n— >+oo 

Since Y^™ is bounded from below, uniformly in n and w, Fatou's lemma yields 

Ep[y 2 °°] < liminfEp[F 2 n l. (2.14) 

We prove below that we have 

EpK°°] < liminfEppT 1 ]. (2.15) 

n— s>oo 

Plugging (|2TTi|) and (12TT51) into PTE]) , we obtain 

r(Q°°) = Ep^ 00 + y 2 °°] < liminf T(Q n ) < liminf T(Q n ) < inf T(Q) 

n—>oo Q(zQf 

which proves that Q°° is indeed optimal. 

It now remains to show that EpPK 00 ] < lim inf Ep [Y, n ] . 

n— s-oo 

We set for m G N; R m := Wq )T 1^ w « >_ m j. Thus for all n G N U {+00}; 

Yi = ZjUq T = Z^R-m + ZjIIq T 1^ u & T< _ m } 
Since R m > — m and Ep[Z^;] = 1, Fatou's lemma yields : 

E P [Z^l4 T ] < liminf E P [Z^ T }. 



Hence 



E P [yn = E P [Y?l {KT >_ m} ] + Ep[YTl KT< _ m} ] 

< liminf Ep[Z T i? m ] +Ep[Z^ T l {w , T< _ m} ] 



<hminfEp[y i n ]+2 sup E P [Z?|^ r |l { ^ m} ]. 

n ~ > °° nGNU{oo} °' T 



It remains to show that 

18 



lim sup Ep[Z£|^ jT |l { ^ } ] = 0. 
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However, Lemma 12.11 and Proposition ?? gives for any n G N U {oo}: 
E P [Z%\Uo !T \l {u s T< _ m} ] = ^Qn[\Uo jT \l {u s T< _ m} ] 

< i(d(Q n \P) + «) + ^E P [l KT< _ m} /*( 7 a ^ |C7.[ds + 7 «|f/T|)] 

< I (if(l + r(Q n )) + «) + E P [l {<T< _ m} r (7« ^ \Us\ds + ia\U T \)\). 
By using inequality ()2.1ip . we obtain for all 7 > 

sup E P [z^ T \i {u s } ] < I(K(i + r(g 1 )) +k) 

neNU{oo} 7 7 

T 

+ -Ep[i { ^ T <_ ro} /*(7« jf I^M* + 7«|£4l)D- 

By the dominated convergence theorem and using the integrability Assumption (A2) 
and since /* is non negative function, we have 

lim K P [l {u s } r( 7 a F \U s \ds + 1 a\U T \)} = 0. 

tn — 1 00 L u , 1 j j q 

Then, for all 7 > 

lim sup Ep[Z£|W tf , r |l {w * T< _ m} ]<-(^(l + r(Q 1 )) + K ). 

m->+oo neNu {oo} 1 0,T X 7 

By sending 7 to +00, we obtain the desired result. □ 

Our next result is to prove that the minimal measure Q* is equivalent to P. For this reason 
we need the following additional assumption: 

(A3):/ is differentiable on (0, +00) and /'(0) = lim f'(x) = —00. 

x->0+ 

We uses as in Bordigoni, Matoussi and Schweizer [8j an adaptation of an argument given 
by Frittelli [12], and we start with an auxiliary result. 



Lemma 2.2 Let Q° and Q 1 two elements in Qf with respective densities Z° and Z . Th 



en 



sup Ep 

0<t<T 



[f\Z° t ){Zl-Z^y] <d(Q l \P) + n 



Proof. We pose Z x = xZ 1 + (1 - x)Z° and for x G (0, 1] and fixe t G E, 

H(x,t):=kf(Zf)-f(Z?)). (2.16) 

x 

Since / is strictly convex, the function x 1— > H(x, t) is non decreasing and consequently : 

H(l,t) > lim -(/(Zf) - f(Z?)) = ±f(Zf) U =0 
x\o x dx 

= f{z«)(z\ - Z»). 
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From Assumption (H2), we obtain 

f'(Z?)(Zl - Z° t ) < H(l,t) = f{Z\) - 



(2.17) 

<f{Zl) + K 



Since j{Z\) + k > 0, then f'{Z^){Z} - Z?)+ < j[Z\) + k. Replacing in the inequality 
Z® by Z\ we obtain 

Ep[f(Z})]<E P [f(Zi)]=d(Q 1 \P) 
Taking the expectation under P in equation ()2. 17j) we obtain the desired result. □ 

Theorem 2.2 Under the Assumptions (A1)-(A2) , the optimal probability measure Q* is 
equivalent to P. 

Proof. 

1) As in the proof of Lemma 1231 we take Q Q ,Q X G Qf , we set Q x := xQ 1 + (1 - x)Q° 
for x £ (0; 1] and we denote by Z x the density process of Q x with respect to P. Then we 
obtain 

±(T(Q*)-r(Q°))=E P [(Zi-Z°)U s ] 

+ -PEp[ f $sS s s (f(Z x ) - f(Z° s ))ds + 4(/(^f ) " /(££))] 

z jo 

= E P [(Z£-Z£)W*] 

+ PEp[[ 5 s S s s H(x,s)ds + S^H(x,T)} 
Jo 

Since x i— >• -ff (x; s) is non decreasing and using Assumption (H2), we have 

H(x, s) < H(l, s) = f{Z l s ) - f(Z° s ) < f{Z]) + k, 

where the right hand of the last inequality is integrable. Hence we can use monotone 
convergence Theorem to deduce that 

Ar(Q-) | s=0 = E P [(4 - Z° T )U^ T ] + PE P [£ 5 s S s s f\Z Q s ){Zl - Z° s )ds 

+ s s T f(z° T )(z^-z° T )] ( 2 - 18 ) 

:=E P [11]+Ep[y 2 ]. 

Under Assumptions (A1)-(A2) and from inequality (|2.3p . we have Y\ E L 1 (P). As in the 
proof of Lemma 12.21 an d since x i— >• iif (x, s) is non decreasing we obtain 

*2 < / T S S S S S H(1, s)ds + S S T H(1,T) < [ T 5 s S s s (f(Zl) + K )ds + s£(/(4) + «) 
jo Jo 

which is P-integrable because Q 1 6 Qj . From Lemma 12.21 we deduce that Y 2 + £ ^{P) 
and so the right-hand side of ()2. 18j) is well-defined in [—00, +00). 
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2) Now take Q° = Q* and any Q 1 € Qf which is equivalent to P this is possible since Qf 
contains P. The optimality of Q* yields T(Q X ) — T(Q*) > for all x € (0; 1], hence also 

^r(Q X ) U=o> 0. (2.19) 
dx 

Therefore the right-hand side of (|2.18p is nonnegative which implies that I2 must be in 
L 1 (P). This allows us to rearrange terms and rewrite (|2.19p by using (|2.18p as 

/3E P [ f 5 s S 5 j'(Z* s )(Zl - Z* s )ds + S s T f{Z* T ){Zl - Z* T )\ > -E P [(Z£ - Z* T )U 5 ] (2.20) 
J 

But the right-hand side of (pT20D is > -00. So if we have Q* ¥ P, the set A := {Z£ = 0} 
satisfies P[A] > 0. Since Q 1 w P, we have Z\ > 0, and so f'{Z^){Z)p — Zj)~ = +00 on 
A. This gives [f'(Z^)(Z^ — Zj>)~] = 00 since Q 1 ~ P. But since we know from Lemma 12.21 
that [f(Z*)(Zi - Z* )+] € L l (P), we then conclude that E P [f(Z*)(Z^ - Z* )] = -00 and 
this gives a contradiction to (|2.20p . Therefore Q* ~ P. □ 

2.3 Bellman optimality principle 

In this section we establish the martingale optimality principle which is a direct consequence 
of Theorems 1.15 , 1.17 and 1.21 in El Karoui |11| . For this reason, we introduce some 
notations. Let S denote the set of all J-"-stopping times r with values in [0, T] and T> the 
space of all density processes Z Q with Q e Q f . We define 

V(Q, t) := {Z QI G V; Q = Q' on F T ] 

T(r,Q) :=E Q [c(.,Q)\T T ] 
and the minimal conditional cost at time r , 

J(t,Q) := Q - essinf T(t,Q'). 

q'gv(q,t) 

Then f|2. 1 j) can be reformulated to 

find inf r(Q)= inf E Q [c(., Q)] = E P [J(0; Q)] (2.21) 

by using the dynamic programming principle and the fact that Q = P on for every 

QeQf. 

Proposition 2.3 (1) The family {J(t,Q)\t G 5,Q £ Q/} is a submartingale system. 

(2) Q* G Qf is optimal 44> {J(t,Q*)\t € <S} is a martingale system. 

(3) For all Q € Qf there is an adapted RCLL process J® = (Jf)o<t<T which is a right 
closed Q -submartingale such that : J® = J(r,Q) Q-a.s for each stopping time r. 
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Proof. We should like to apply Theorems 1.15, 1.17, 1.21 in El Karoui [llj . These theorems 
requires that 

1. c>0or inf En'[\c(; Q')\] < oo for all r G S and Q G Q f , 

Q'eV{Q,t) v 1 

2. The space T> is compatible and stable under bifurcation, 

3. The cost functional is coherent. 
The proof is postponed in the appendix. 

Remark 2.2 In the proof of the Bellman Optimality principle, condition \2. 3\) ensures that 
J(r, Q) G ^(Q) for each r G S. In our case we prove such result directly (see Lemma \4A\ )- 

□ 

3 Class of Consistent time penalty 

In this section we assume that the filtration (Pt)o<t<T is generated by a <i-dimensional 
Brownian motion W. Then for every measure Q <C P on Tt there exists a d- dimensional 
predictable process (rjt)o<t<T such that ||?7t|| 2 <ii < +oo Q.a.s and the density process of 
Q with respect to P is a RCLL martingale = (Z\ )o<t<T given by: 

Zf =£{[ Vu dW u ) Q.p.s,Vt G [0,T]. (3.1) 
Jo 

where £(M) t = exp(M 4 — ^(M) t ) denotes the stochastic exponential of a continuous local 
martingale M. Barrieu and El Karoui [7], Delbaen et al.[3] have introduced a consistent 
time penalty given by: 

lt {Q) = E Q [J h{s,n s )ds\T t ] 

where h : [0, T] x Q x W 1 — > [0, +oo] be a convex, proper and lower semi-continuous function 
such that h(., .,0) = 0. 

In our case, we consider a deterministic function h defined on M. d . We also assume that 
there are two positive constants n\ and K2 satisfying: 

h(x) > Ki\\x\\ 2 — K2- 

The penalty term is defined by 

t4,t{Q V ) = / T ^S( f h( Vu )du)ds + || F h{ Vu )d U ,V0 <t<T. (3.2) 

for Q P on Pjf. As in the case of /-divergence penalty, we have to solve the following 
optimization problem: 

minimize the functional Q v ^ T(Q n ) := Eq„[c(., Q v )} (3.3) 

over an appropriate class of probability measures Q 71 <C P. 
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Definition 3.1 For all probability measure Q v on we define the penalty function: 

it(Q v ) 



^Q«[J h(rj s )ds\P t ] i/Q" <P onT T 



+00 otherwise 

We note Qj the space of all probability measures Q r] on (O, J-) such that Q v <C P on Tt 
and 7o (<7') < +00 and Qf := {Q" € Qf\Q « P onP T }. 

Remark 3.1 1- We note that Qf is non empty set because P € Qj . 

2- The particular case of h{x) = \\x^ corresponds to the entropic penalty. Indeed 



tf(Q"|P)=EQ„[log( 



dP n 

T 1 r T 

II, , 2 



E Q"[/ VudW u -- / \rj u \ du] 



si 2 jo 



Since (J r/ u dW u ) is a local martingale under P , then by the Girsanov theorem (J r] u dW u )- 
fo \Vu\ 2 du is a local martingale under Q v and so 

H(Q , i\P)=E Q v[[ rjudWu- f \r, u \ 2 du + \ ( \r, u \ 2 du] 
Jo Jo 1 Jo 

= E Q ,[i^ \r, u \ 2 du] = 7o (Q"). 



□ 



3- For a general function h we have for all Q v £ Qj, 



HmP)<-L 70 (QV) + l^. (3.4) 



Indeed: 

rT 1 



H(Q r >\P)=E Qv [±^ \ Va \ 2 d8]<E P [±-(J (h(\ Vs \) + K 2 )ds)} 

<E Qv [^-([ T (h(\ Vs \)ds) + 
Jo 



TK2 

2ki J 



(3.5) 



2k\ 2k\ 

In particular H{Q ri \P) is finite for all Q 11 £ Qj. □ 



To guarantee the well-posdness of the problem (|3.3p we need to replace the Assumption 
(A2) by (A'2): the cost process U belongs to L>^ xp and the terminal target Ut is in L exp 
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Remark 3.2 Under Assumption (A' 2), we have 

A f \U s \ds + p\U T \ G L exp , for all (A,/x) G R%. (3.6) 

JO 

Indeed, since x i— > exp(x) is convex we have 

E P [exp(A / |E/ s |ds + /x|i7 T |)] 
Jo 

= E P [exp(i x 2A f \U s \ds + i x 2//|C/ r |)] 
^ Jo ^ 

< E P [~ exp(2A / \U s \ds) + \ exp(2n\U T \)} 
1 Jo 2 

= -E P [exp(2A / |[/ s |ds)] + -E P [exp(2^|C7 T |)] < +oo. 
2 Jo 2 



3.1 Existence for an optimal model 

The main result of this section is to prove the existence of an unique probability Q v * that 
minimizes the functional Q n i— > T(Q ri ) in all probability Q v G Q c j. We begin this section by 
giving some estimates for T(Q V ) for all Q v G Qj. 

Proposition 3.1 Under assumption (Al)-(A '2), we have for all Q r] G Qj : 

1. c{.,QV)eL l (QV). 

2. T(Q ri ) < C(l+7o(Q r ')) for some positive constant C which depends only on a, a, (3, 5, T, U, U. 
In particular T(Q V ) is well defined and finite for all Q v G Qj. 

Proof. 

1. As in Proposition EZQ we have Z^R G ^(P) i.e R G L l (Q r >). In addition, 

T 5 S S 5 S { f h{r, u )du)ds + S S T [ T h{n s )ds\ 
n JO JO 



< ^ Halloo ( y o h{i lu )d u y s + J h{n s )ds (3.7) 

< (\\8\\ oa T+ l) j\(n s )ds G L^Q"). 
2. From inequality (|2.3p with /(x) = xlogx, we have: 

r(Q") < Ep[Z^i?] + f3E QV [ [ T 5 S S S S ( f h(r] u )du)ds + S S T f h(r] u )du} 

JO JO JO 

< Epilog Z^ + e^e^ + PiW 5 T + 1)E£ [ / h( Vu )} 

Jo 



< H{QV\P) + e- l E P [e R ] + /3(|| 5 |U T + l)jo(Q v ). 
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From inequality (|3.4p . we have 

IW) < + * Hoc T + l)h,(Q") + e- 1 !^] + ^1. 

We takes C := max(e- 1 J Bp[e' R ] + h /9(|| S IU T + 1)) which is finite, then 

the result follows. □ 

The following proposition gives a lower bound for our criterion T(Q V ) for all Q 11 G Qj. 

Proposition 3.2 Under the assumptions (A1)-(A'2), there exists a positive constant K 
such that for all Q v G Q/ 

7 o(Q")<#(l + lW)). 
In particular inf rfQ* 7 ) > — oo. 

Proof. For Q v G Q/, we denote by its density process. Since h takes values on [0, +oo], 
we have 

peqv [ r 5 S s s s ( r h^du^ + s s T r > ^ ^ r % u )<h 

Jo J o Jo Jo (3.8) 

> /3e-^ T l0 (Q^). 

Moreover, since < S 5 < 1, we have: 

Eq»[Wo,t] > -Eg, [22] = -E P [^i2]. (3.9) 
From inequality (12. 4p where f(x) = xlogx, and as a consequence f*(Xx) = e A:r_1 we have 

xy < \(ylny + e~ 1 e Xx ) for all (x, y, A) G R x R^ x R* (3.10) 
A 



We get: 



From inequality (|3.4p . we deduce that 

Ep[^U3 < — 7 o(Q r ') + + -r-^p[e XR \. (3-11) 
2Aki Z\k\ A 

From the definition of T(Q ri ), we have 

r(Q") = E p [Z t U ,t] + l3E Q v[ ^ 5 S S 5 S { f h(r] u )du)ds + 4 /" % U )<H 
From ([3T5 j) .([3l? ]) and (l3~TT]l . we obtain 

r(0 > /3e-ll 5 ll- r 7 o(Q") - ^7o(Q") - ^ - ^Ep[e^] 



A A A 



fT 
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By choosing A > large enough, there exists u > such that /3e I'" 5 !! 002 " — > fi. From 

Remark^ we have E P [e XR ] is finite. Then, by taking K := ± max(l, — — + ^r- E P [e XR }), 

'' '2Xh' X 

we obtain the desired result. □ 
By combining the previous Proposition and the inequality (|3.4p we obtain the following 
result. 

Corollary 3.1 Under the assumptions (A1)-(A'2), there exists a positive constant K' 
such that for all € Qj 

H(Q^\P)<K'{l + Y(Q^)). 

To prove the existence of the minimizer probability measure, we need the following technical 
estimate. 

Lemma 3.1 For all 7 > and any A € Tt we have: 

E Q ,[\U s 0T \l A ]<^^ + -^ + ^— + ^—E P [l A ex V {Xa \U s \ds + Xa\U T \)}. (3.12) 
ZXki Zak\ XX Jo 

Proof. From Remark 13.21 we have R € L exp . By using the inequality (|3.10p . we obtain 

Z^\U s 0T \l A <Z^{a f \U s \ds + a\U T \)l A 
Jo 

< \[Z^\n(Z^) + e - 1 exp(Aa f \U s \ds + Xa\U T \)]l A 
x Jo 

< \[Z^ln(Z^) + + ^— l A exp(Aa I \U s \ds + Xa\U T \). 
A A Jo 



By taking the expectation with respect to P and using the inequality (13. 5ft we get 

E Qn [\U^ T \l A ] < -— 10 (Q r >) + —^ + ^- + ^-E P [l A exp{Xa / \U s \ds + Xa\U T \)], 
ZXk\ ZXk\ XX Jo 

and so inequality (|3.12p is proved. □ 
Proposition 3.3 The functional Q v 1— > T(Q ri ) is convex. 
Proof. By the product derivatives formula, we have 

^(S s s ( f h( Vu )du)) = -5 S S 5 S f h(ri u )du + S s h( Vs ). 



ds 

By integrating between and T we get: 



S S S S S ( /" h{r, u )du)ds + 4 / T % u )d« = / T S 5 s h(r] s )ds. (3.13) 
Jo Jo Jo Jo 



Fix A G (0, 1) and Q' n and two distinct elements of Qf. 

rdQ 

Up 



Let Q = XQ V + (1 - A)Q' ? ' and L t = E P [^|.F t ]. Using Ito's formula, we get L t = £{q.W) t 



where (qt)o<t<T is defined by 

vty£? + (l- 
AL? + (1-A)L?' 



A77L? + (1 - X)ri'LP t , JT , r _ 

^^^T^fiar^-A^j <ft®<iPa.e. i 6 [0,T]. 
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From the definition of the penalty term in T, we have 



TZ 0tT (Q)=K Q / S S S S S ( h{q u )du)ds + S 5 T h(q u )du 



E. 



Q 



T 



S s s h( 



Jo 



Xr]L v s + (1 - X)jUl 
XL] + (1-X)V S ' {^+(i-a)l/>o} 



' s XyLl + jl-XWLl 
^ AL2 + (1 - A)^ j Wd 



, ds 



(1 - A)L!f ' i A ^+li-A)^ >o} 
where the second equality is deduced from (|3.13p . By the convexity of h we have: 



E, 



5 S S S ( / h(q u )du)ds + S T I h(q u )du 



XV 



o 



AL<? + (1 - A)L 



7%s) + 



Ep 

XKnv 



(XV s + (l-X)L*)S s s ( 

T 

L JO 



XVI + (1 - X)L 
XVI 



Y h (v's))l {XL , +{1 _ x)L f >0} ds 

(i - a)l/ 



S s s h( Vs )l s 



AXJ + (1-A)L? v "' ■ AL2 + (l-A) J Lr /t( ^ ))1{Ai " +(1 - A)i2 ' >0} 



*] + (1 - A)E oV [ / W,)!^.^ 



^{AL2 + (l-A)L? >0} 

Since we have Eq[Z^o,t] = AEqi7 [^o,t] + (1 — A)Eg^/ [Wo,t] > we deduce that 

r(Q) < Ar(Q") + (i-A)r(Q"'). 



□ 



The following theorem states the existence of a probability measure solution of the opti- 
mization problem (13 .3p . 



Theorem 3.1 Assume that (A1)-(A'2) are satisfied. Then there exists a probability mea- 
sure Q v * E Qj minimizing Q v i— > r(Q T? ) over a// E Q'j. 

Proof. 

1. Let (Q ,?n )ngN be a minimizing sequence of Q C j- ie: 

\ lim r(Q"")= inf r(Q"). 

We denote by Z n := Z^* 7 = £(J r] n dW) the corresponding density processes. 
Since each > 0, it follows from Komlos lemma there exists a sequence (Z^) nG jsj 
such that Zj, E conv(Z^, Z^ +1 , •••) for all n E N and (Z™) converges -P-a.s to a random 
variable Z^. 

Z^? is positive but may be infinite. As Qf is convex, each Z^ is associated with a 
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probability measure Q n G Qf. This also holds for Zj? ie that dQ 00 := Z°°dP defines 
a probability measure (J 00 € Qf. Indeed, we have first: 

r(Q n ) < sup r(Q" m ) = r(Q" n ) < r(Q" 1 ), (3.14) 

m>n 

where the first inequality holds since Q v h-> r(Q ,? ) is convex and n i— >■ r(Q' ? ™) 
is decreasing and the second inequality follow from the monotonicity property of 
(IW ))n- Therefore the corollary ^D} gives 

supE P [Z n ln(Z n )] = suptf(Q n |P) < + su P r(Q n )) < + T(Q^)). (3.15) 

neN nSN nSN 

Thus (ZS) ne ^ is P-uniformly integrable by the Vallee-Poussin's criterion and con- 
verges in L l (P). This implies that E P [ZS°] = lim E P [ZJf\\ = 1 so that Q°° be a 

n— >+oo 

probability measure and Q°° < P on P T . We define Zf 3 := P P [ZfP|Pt]. (Z t °°)t is a 
martingale. From the representation given by (13. lh there is an adapted process 
valued in R d satisfying / Q T ||T£°|| 2 di < +oo P.a.s and Zf> = S (jj »7?W a ). 

2. We now want to show that Q°° G Qj. 

Let Z°° the density process of Q 00 with respect to P. Since we know that (Zj<) 
converges to Z°° in L (P), the maximal Doob inequality 

P[ sup | Z t °° - Zf |> e] < -E P [| Z|° - Z% \] 

o<t<T e 

implies that ( sup | Z t °° — Z™ |) n( =N converges to in P-probability. 

0<t<T 

Going to a subsequence, still denoted by (Z n ) ne N 5 we can assume that ( sup | Z f °° — 

0<t<T 

Z" |)neN converges to P-a.s 

By the Burkholder Davis Gundy inequality there is a constant C such that 
E[{Z°° - Z n )\\ < CE[ sup | Zf - Z™ |]. 

0<t<T 

Let M™ := sup | Z^° — Z™ | and (r n ) a sequence of stopping time defined by 

0<s<t 

= f inf {t G [0, T[; M" > 1} if {t G [0, T[; M™ > 1} ^ 
1 T otherwise 

Since M™ is bounded by MJ5 A 1 then M™ converges almost surely to and by the 

dominated convergence theorem converges to in L X (P). Then, using Burkholder 

i 

Davis Gundy inequality (Z°° — Z n )r n converges to in L l (P) and a fortiori in prob- 
ability. 

As, (Z°° - Z n ) T = (Z°° - Z n ) Tn l {Tn=T} + (Z°° - Z n ) T l {Tn<T} , then for all e > 0, 

P((Z°° - Z") T > e ) < P«Z°° - Z«) r „l {Tn =T } > e) + P«Z°° - Z") t 1 {t „ <t} > e) 
<P((Z°°-Z n ) Tn >e) + P(T n <T) 
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From the convergence in probability of ((Z°°—Z n ) Tn ) n , wehave lim P((Z oc —Z n ) Tn > 

n— >+oo 

e) = 0. Since M n is an increasing process, we have 

P(r n <T) = P({3t G [0,T[ s.t M™ > 1}) = P({M% > 1}). 
Since M2 converges in probability to 0, we have P({M^ > 1}) — > 0. Then 

71— >-+0O 

lim P(r n < T) = 0, and consequently lim P({Z°° - Z n ) T > e) = .ie ((Z°° - 

n— >+oo n— »+oo 

Z n )T)n converges in probability to 0. We can extract a subsequence denoted also by 
Z n such that ({Z°° - Z n ) T ) n converges almost surely to 0. 
On the other hand we have 

i-T 



(Z™-Z n ) T =[ {Z^T-Ktufdu. 
Jo 



It follows that the processes Z n fj n converge in dt ® dP- measure to the process Z 00 ?) 00 . 
Since Z n — > Z°°dt®dP-&.e, we have ff 1 converges in dt <g) dP- measure to fj°°. Fatou's 
lemma and inequality (|3. 15|) gives: 



7o(0°°) =^p[Zf [ h(fj™)du] < liminf E P [Z£ / h(r%)du] < +oo. (3.16) 

Jo n->+oo ,/ 

This shows that Q°° € Q f . 

Now we will show that the probability Q°° is optimal. 

For n G NU{+oo}, let Y/ 1 := ZJjlA s and KJ 1 := (31Z s (Q n )) then lim Y> = Y^ 00 P-a.s 
for i = 1,2. As Y 2 n is bounded from below, uniformly in n and oj, the Fatou's lemma 
yields: 

EpfK 00 ] < hminf EpfYo 1 ]. (3.17) 
By adopting the same approach as in Theorem 13.11 we show that: 

Ep [Yi°°] < lim inf E P [Yf] . (3.18) 

n— loo 

Inequality ([3T7]), ([3T8]) and ff37T5]) provide that: 

r(Q°°) = Ep^ 00 + Y, 00 ] < liminf T(Q n ) < liminf T(Q n ) < inf T(Q). 

n— >oo n— >oo Q&Qf 

This proves that Q°° is indeed optimal. 



□ 

3.2 BSDE description for the dynamic value process 

In this section, we employ stochastic control techniques to study the dynamics of the value 
process denoted by V associated with the optimization problem (I3.3D . We prove that V 
is the only single solution of a quadratic backward stochastic differential equation. This 
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extends the work of Skiadas [19] , Schroder and Skiadas [T7] . 

We first introduce some notations that we use below. We Denote by S the set of all T 
stopping time r with values in [0, T], T>° the space of all processes r/ with G Q c j and T> c ' e 
the space of all processes rj with Q v G Q/ 6 - We define: 

V c (ti,t) := e P c , Q" = Q^ an [0,r]} 

r(r,Q") :=E Q „[c(,Q")|.F T ]. 

We note that r(0, Q 11 ) and r(Q ,? ) coincide. The minimal conditional cost at time t is 
defined by 

J{t,Q v ) ~Q v - essinf T(t,Q t ''). 

tj'e» c (f),T) 



Then the problem (13.3|) can be written as follows: 

give inf r(Q")= inf Eq„[c(., Q")] = E P [J(0, Q^)]. (3.19) 

Where the second equality is deduced since the dynamic programming principle holds and 
we have = P on J" for all Q v e Q c f . 

The following martingale optimality principle is a direct consequence of Theorems 1.15, 1.17 
and 1.21 in El KarouifTT]. For shake of completeness, we give the proof in the Appendix. 

Proposition 3.4 (1) The family {J(r,Q ?? )[r G S,Q V G Q c j} is a submartingale system. 

(2) Qp* G Qj is optimal 43- {J(r, Q' n * )\t G S} is a martingale system . 

(3) For all Q 11 G Qj there is an adapted RCLL process J? = J(t,Q v ) which is a right 
closed Q v -submartingale such that : J? = J(t, Q* 1 ) Q v -a.s for each stopping time r. 

In order to characterize the value process in terms of BSDE we need the following propo- 
sition. 

Proposition 3.5 Under (A1)-(A'2), we have 

inf r(Q") = inf r(Q"). 

Proof. Let G Q c f such that inf c Eq[c(., Q)] = E Qv * [c(., Q"*)] and A G [0,1), then 
AQ 77 * + (1 - A)P G Q^' e . Since h-> r(Q 7 ') is convex then 

T(XQ V * + (1 - A)P) < Ar(Q"*) + (1 - A)T(P) VA G [0, 1), 

which implies 



Consequently, we have 



limsupr(AQ'' + (1 - A)P) < T{Q^ ). 



mf_r(Q")>__inf ce r(Q"). 



Qi&Qj Q*ieQ c f ' e 

The converse inequality holds since Q c f e C Qj. □ 
We use later a strong order relation on the set of increasing processes defined by 
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Definition 3.2 Let A and B two increasing process. We say A X B if the process B — A 
is increasing. 

We already know from Theorem 13.11 that there exist an optimal model Q' n G Q c j. For each 
Q v G Qj' e and t £ 5, we define the value of the control problem started at time r 

V{r,Q n ) = Q v -ess inf V(t, Q^), 

where 

V(t, Q"') = E QV [Z< t | Jy] + 0V [<r(Q^)l^-]. 
We need to define the following space 

% p d = |(Zt)o<t<T^-P r ogressively measurable process valued in M rf s.i Ep[ J \Z u \ p du] < oo|. 

The following result characterizes the value process V as the unique solution of a BSDE 
with a quadratic generator and unbounded terminal condition. Precisely we have 

Theorem 3.2 Under the assumptions (A1)-(A , 2), the pair (V,Z) is the unique solution 
in L>o Xp xH P d ,p> 1, of the following BSDE: 

dY t = (5 t Y t - aU t + h*d-Z t ))dt - Z t dW t , 

P (3.20) 

Y T = clU' T . 
and Q* is equivalent to P. 

Proof. By using Bayes formula and the definition of lZ^. T (Q' n '), it is clear that V(t, Q* 1 ') 

depends only on the values of rf on (t, T] and therefore independent of Q v since n = rj on 

[0, t]. Thus we can also take the essinf under P w Q . From Proposition 13.51 we could take 

the infimum over the set Q c f ' e , which implies V(r, Q v ) = P — essinf V(t, ), for all 

1 " T]'eV c {r),T)nD c > e 

Q v G Q/ 6 - Since V(t,Q v ) is independent of Q v , we can denote V(t,Q v ) by V(t). 
We fix r/ G V{Q^,t). From the definition of n\ T {Q^) (see equation (133]) ). we have 

<t(Q"')= ( T 5 s S 5 s {f h{r,' u )du)ds + S 5 T [ T h(ri' u )du 
Jo Jo Jo 

= f T 5 S S 5 S ( f h( Vu )du)d s + 5* / T % u )d« + S*'Jl 5 TjT {Q r >'). 
Jo Jo Jo 

By comparing the definitions of V(r) = V(r, Q* 7 ) and J(t,Q v ), then we get for € Q^' 6 

JJ? = S T V T + a£ S s U s ds + P(f a 6 s S s(f o h (Vu)du)ds + S S T £ h{rj u )duj . (3.21) 

Arguing as above, the essinf for J(t,Q v ) could be taken under P ~ Q v . From the Propo- 
sition [331 Jt admits a RCLL version. From equality (j3.21 j) . we can choose an appropriate 
RCLL process V = (Vt)o<t<T such that 

V T = V(t) = V(t, Q 71 ), F.a.s for all r G S and Q v G Qf 
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and then we have for all Qp £ Q C j £ 

J? = S?V t +aJ S 5 s U s ds+p^J 5 S S 5 S (J h{r] u )du)ds+Sf J h(n s )ds^ dt®dP a.e, < t < T. 
° oo o 

Prom Remak f3.ll belongs to Qj' e - If we take r\ = 0, the probability measure Q° coin- 
cides with the historical probability measure P. Then, by the Proposition 13.41 J p is P- 
submartingale. From equation (|3.2ip . J° = S S V + a f SfU s ds and thus, by Ito's lemma, 
we deduce that V is a P-special semimartingale. We write its canonical decomposition as 
follows: ^ ^ 

V t = V - I q s dW s + [ K s ds. (3.23) 
Jo Jo 

For each G Q c f , we have Z 7 } = £(J Q rjdW). Plugging (I3T2"31) in to (I3T21"]) . we obtain 
dJ? = S s t (-q t dW t + K t dt) - 5 t S t V t dt + aS t U t dt + l3S 5 t h(7] t )dt. 



By the Girsanov theorem the process — j Q qtdWt + j qtVtdt is a local martingale under Q v 
and the dynamics of (J^)t is given by 

djy = S S t (-q t dW t + q t n t dt) + St(K t - qm + Ph( Vt ))dt - 5 t S 5 t V t dt + aS S t U t dt. 

J 11 is a Q v — submartingale and J v is Q v - martingale. Such properties holds if we choose 

Kt = 5Vt — aUt — essmf(—qtf]t + /3h(r]t)), where the essential infmum is taken in the sense 
v 

of strong order ^ . Then 

K t = 5V t - aU t + esssupta - /3% t )) = 8V t - aU t + ph*{\q t ). (3.24) 

v P 



This ess inf is reached for rf = {h')- l (±q t ). From (pT23l) and (pT24"D we deduce that 

(3.25) 



dV t = (StV t - aU t + Ph*{±q t ))dt - q t dW t 



y V T = a'U' T 

Moreover we have, (h*)'(qt) = rj* dt (g> dP a.s . By the convexity of h* , there exists a 
positive constant c such that |(/i*)'(x)| < c(\x\ + 1.) Then, we have 

[ T \4\ 2 dt = f T \{h*)'(q t )\ 2 dt < c 2 [ T (l + \qt\fdt < +oo, (3.26) 
Jo Jo Jo 

which mean P{^f - = 0} = P{ff \^\ 2 dt = oo} = 0. Hence ~ P. □ 

Remark 3.3 From Briand and Hu jll5f the equation \3. 20\) has a unique solution, because 
inequality 

h(x) > Ki\x\ 2 — Ki 
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implies 

h*(x) < 

2ki 



h*(x) < — \x\ 2 + K 2 



and hence the driver f of BSDE h3.2Cfy given by f(t, w, y, z) = 5ty — cdJt + (3h*(\z) satisfies: 

1. for all t E [0, T], for all y £ R, z i— > f(t,y,z) is convex; 

2. for all (t, z) € [0, T] x R d , 

V(y,y')eR 2 ;\f(t,y,z)-f(t,y',z)\ <\\ 6 ^ 

3. /or all (t, y, z) € [0, T] x R x R d ; 

|/(t,y,*)| = |&J/ - aU t + 2f)] <|| <5 ||oo \y\ + \a\\U t \ + T^M 2 + k 2 . 

Since the process \otUt\ £ D^ p and the terminale condition qXJt £ L exp , the existence of 
the BSDE solution is ensured. Uniqueness comes from the convexity of h* . 

3.3 A comparison with related results 

In the case of the entropic penalty, which corresponds to h(x) = ^\x\ 2 , the value process is 
described through the backward stochastic differential equation: 

dY t = (5 t Y t - aU t + ^\Z\ 2 t )dt - Z t dWt 

2 P . (3.27) 

Y T = a'U' T 

These results are obtained by Schroder and Skiadas in |19t ITT] where a 1 = 0. In the context 
of a dynamic concave utility, Delbaen, Hu and Bao [5] treated the case 5 = and £ = a'U' 
is bounded and j3 = 1. In this special case the existence of an optimal probability is a direct 
consequence of Dunford-Pettis theorem and James theorem shown in Jouini-Schachermayer- 
Touzi's work [3]. Delbaen et al. showed that the dynamic concave utility 

Y t = ess inf 22 [£ + / h(r) u )du\ Tf] 
satisfies the following BSDE: 



dY t = h*(Z t )dt-Z t dW t 



(3.28) 



□ 
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4 Appendix 

4.1 Proof of the Bellman optimal principle 
4.1.1 f-divergence case 

Lemma 4.1 For all r G S and all Q G Qf, the random variable J(t,Q) belongs L l {Q) 
Proof. By definition 

J(r, Q) < T(r, Q) <M Q [\c(.,Q)\\T T ], 

and consequently 

(J(t,Q)) + <E Q [\c(.,Q)\\T T ] 

is Q— integrable according to the Proposition 12. 11 
Let us show that (J(t,Q))~ is Q— integrable. We fix Z®' G V(Q,t). 

In the inequality (j2.8|) . choosing 7 > such that /3e^ _T " <5 "°°' 1 = 0, then we obtain 

7 

T(t,Q') > -B := -pK-^(T \\ 5 lloo +1) - -^\-Ep[r( 7 a f \U{s)\ds + -ya\U T \)\ T T ] 

(4.1) 

Since the random variable B is nonnegative and does not depend on Q' , we conclude that 
J(t, Q) > -B. Since f*(x) > for all x > 0, we have 



J(t,Q)~ < B := Pk^q(T II S ||oo+l) + ^[-E F [/*( 7 a / |C/( S )|ds + 7 a'|^|)| JV] 

(4.2) 

Finally, -B £ ^(Q) because the assumption (A1)-(A2). □ 

Lemma 4.2 TTie space T> is compatible and stable under bifurcation and the cost functional 
c is coherent. 

Proof. 1-We first prove that T> is compatible 

Take Z Q G V,t G S and Z Q ' G V(Q,r). Then , by the definition of V(Q,t) we have 

= QV T 

2- Take again Z Q £ V,t £ S,A £ J=~ T and Z Q ' G T>(Q,r). We have to check that 
Z q \t a \Z q ' := Z Q 'l A + Z Q l A c is still in V. 

To this end, it is enough to show that Z®\t a \Z® is a J-~-martingale and that {Z^\t a \Z^ )t 
defines a probability measure in Qf. 

Let us start proving that ZQ\t a \ZQ' is a martingale. Since our time horizon T is finite, we 
have to prove that 

E P [(zQ\T A \zQ') T \F t ] = {zP\r A \zF)t 
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Observing that 1{ T <<} + l{r>t} = 1> we have 

V P [(Z% A \zV) T \T t ] = Ep[Z$'l A + z£l A c(l {T < t} + l {T>t} )|Ji] 

= E P [z!j!'l An{T < t} \T t } + E P [Z^'l An{T>t} \ T t ] 
+ E P [Z« 1 An{T < t} + E P [Z? l An {r>t} 1^] 

Since 4n{r<t} and ,4 C n {r < i} are in T t , while i n {t > t} and A c n {r > i} are in 
J> , we have 

Ep[(ZO|rA|ZO') r |Ji] 

= / An{r < t} Ep[Z«'|j;] +Ep[Ep[Z«'l An{r>t} |JVv t ]|-^] 
+ l A c n{T < t} Ep[Z®\T t } +E P [Ep[Z^l Acn{T>t} \T TVt }\T t ] 

= 1 An{r<t} z t + p [ z ?vt 1 An{r>t}\J : 't\ + l A c n { T <t} z ? + P[ z T vt 1 A<=n{r>t} \Ft] 
= 1 An{r<t} z ? +Ep[Z^ l A n{ T >t}\Ft] + ^A c n{r<t} z ? +E P [Z®l A c ri { T>t y\J't} 

From the definition of T>(Q, r), we have Zr = Z? and so 

E P [{Z% A \zV) T \F t ] 

E P [Z?l An{T>t} \T t ] + l Acn{T < t] Z? + Ep[Z«/ Acn{r>t} |j;] 
= l An{r < t} zf +E P [Z?l {T>t} \T t ] + l Acn{r < t} Z« 

= ^An{T<t} Z t + z t ^{r>t} + l^nfr^t}^? ^ ^ 

= ^An{T<t} z ? + z t (!{r>t}nA + l{r>t}nA c ) + lA c n{r<<}^ 
= ^-Ar]{T<i} z ? + ^ l{r>t}nA + z ?^-{T>t}nA c + lA c n{r<t}^ 
= i A zf + 1 A Z® 
= {Z% A \Z®) t 

From the definition of Z q |t a |Z q ' , we have Z Q |r A |Z°' G L^fO,^) and so Z q \t a \Z q ' is a 
^-martingale which implies 

E P [{Z% A \ZV) T } = E P [Z^'l A + Z®l A c] = l A + 1 AC = 1. 

It remains to show that d(Q) < oo where the density of Q is given by Z®\t a \ZQ'. We have 

d(Q\P) + k = E P [f(Z^'l A + Z$l A c) + k] 

= Ep[l A {f{Z$) + k) + l A c{f{Z$) + «)] 

<E P [(/(z£') + «) + (/(£?) + «)] 
< d(Q|P) + <Z(Q'|P) + 2/< 

Where the first inequality is deduced from the Assumption (H2). Then 

d{Q\P) < d(Q\P) + d(Q'\P) + k<oo. 
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3- Take Z Q and Z Q> in V; We denote by A the set {w; Z Q {u) = Z Q> We have to prove 
that 

c{u,Z Q (u)) = c{uj,Z q \u)) 
on A Q— a.s and Q' — a.s □ 

4.1.2 Consistent time penalty case 

Lemma 4.3 The space T> c is compatible, stable under bifurcation and the cost function c 
is coherent. 

Proof. 

1 - V c is compatible. 

Let r, G V,t G S and rf G V c (Q^,r). Then, by definition of V c (Q^,t) we have = 

2 - Z? c is stable under bifurcation . 

Let again 77 G V c ,t J t and rf G V c (Q r, ,T). We must verify that 77" = 77|ta|t/' : = 

t]\a + 77'lyi c remains in V c . ie £7q,,» [ J h(rj")du] < +00 

E QV »[J h(r]u)du] < E Qv »[l A J h(rj u )du + l A c J h(rf u )du] 

= E P [Z?£'l A h(r] u )du + zf'l A c h(r)' u )du] 
Jo Jo 

f' T 1 f T 

= E P [Z^1 A / h(n u )du + Z^l A c / h(rf u )du] 

Jo Jo 
< Eqv [J h(rj u )du] + E QV , [J h(rj' u )du] 

Where the last inequality is deduced from the non negativity of h and the second equality 
is deduced from the definition of 7/ . 
3-The cost function c is coherent. 

Let 77 and r( in V c . Denote by A the set {u,r)(u) = r]'(uj)}. It is obvious that 

c(u,r)(u)) = c(u,ri'(u)) 
Q — a.s and Q' — a.s on A □ 
Lemma 4.4 For all r G S and Q v G Q c j , the random variable J(t,Q v ) is in L 1 (Q r '). 
Proof. By definition, we have 

J(t,Q*>) <T(t,Q*>) <E Q r,[\c(.,Q*>)\\T T }, 

which implies that 

(J(t,QV))+ <E Qv [\c(.,QV)\\T T ] 
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and so (J(T,Q' n )) + is Q^-integrable by Proposition 13.11 
It remains to show that (J(r, Q v ))~ is also Q^-integrable. 
Fix rf £ / D c {Q' n ,t). we have: 

f3E QV ,[J 5 S S S S (J h{r,' u )du)ds + S S T j h{r,' s )ds\T T ]> PE Qn ,[S 5 T j h(rj' s )ds\T T 

> e -||*||» 

Moreover, since < S" 5 < 1 and using Bayes formula, we have: 

V^rl-Tv] > -E^^IJV] = —Le p [Z{r\T t ]. 



Using the inequality (|3.10p and Bayes formula, we get: 

E p [ZJ£r\F t ] < -E P [Zj£logZ!f + e~ 1 e XR \J r T ] 
A 



Z? E QV [log^ \T T \ + — E P [e XR \T T ] 
l -Z^E Qnl [j\' u dW u -\J* MdulTr] + e -^E P [e XR \T T ] 



= jZ?'(^ V i u dW u -^^\ V '\ 2 u du + E QV ,[^ r,' u dW u -£ W\ 2 u du\T T ] 

+ E Q A\ £ Mdu\T T ]) + ^E P [e XR \T T ] 

By the Girsanov theorem the process (j Q rj' u dW u — j Q \r]'\ 2 u du) is a local Q n - martingale and 
therefore Eq v i [ rj' u dW u — \Tf\^du\^F T ] = 0. Consequently, by using that Z% = Z?, we 
have 



= \z^ T ( jf Vu dW u - i jf \ V \ 2 u du + E QV , [I £ \ V '\ldu\T T ]) + ^-E P [e XR \ T T ] 
< \Z1 ( jf - 1 \ n \ldu + E Qn , [1 ^ ^ + K2 du| Jy]) + ^E P [e AR | Jy] 

Thus, we have 

V KtIJv] > "X ( ^ VudW u -~ jf \v\ldu+E QV , [I £ kK) + K2 du\F T }) - & -l^E P [e XR \T T ], 
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and consequently 

r-T 



r(r,QV) >p e -VUT lT{Q r f) _ 1 v[ l W + ^ ^j 
= 09 e -ll*ll-r__^_ )7T(Q ^ ) 

- I(jf" - i jT \ngdu + - fl^Ep^l^]. 

Let A > such that /3 e -ll <5 H°° T - ^ = then 

T(r, Q"') > jf n u dW u \ + ± jT Hldu + ^) - JjE P [ e *V T ] := -5. 

Since the random variable i? is nonnegative and does not depend on Q' n ' , we conclude that 

J(t,Q) > -B. So that J(t,Q)- < B. 

It thus remains to be shown that B G L l (Q ri ). 

Under assumptions (A1)-(A'2), we have 

1 



E Q ^^p[e XR \F T )] = E P [Ep[e XR \T T }] = E P [e XR ] < +oo. 



Moreover 



Eqv[\ [ VudW u \ + \ I \v\ 2 u du\ < +oo. 
Jo 1 Jo 

Hence, B £ L 1 ^). □ 
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